Fast Distributed Algorithm for Replication in Unstructured P2P Networks

نویسندگان

  • Faraz Makari Manshadi
  • Mauro Sozio
چکیده

Replicating data in distributed systems is an important issue, which is often needed for reliability, availability and performance improvement. In unstructured Peer-to-Peer networks, in which random walks are utilized for query routing, replicating data items can improve the probability of successfully finding requested items. In such networks, nodes often donate resources, in particular a part of their storage, to improve the overall performance of the network. Taking into consideration limited storage capacity that each node possesses, it is significant to replicate items with highest ”worthiness”. The ”worthiness” of data items measures the popularity of each data item for each peer. Considering peer specific request rate of each data item, network topology, allocation of items to peers, we investigate the problem of computing the worthiness of each data item. In this thesis, we present a fast distributed algorithm for computing the worthiness of data items. We first propose an algorithm for this problem in a centralized setting (i.e. with complete knowledge of the network) and then turn the centralized algorithm into a fast distributed one exploiting the local information of each individual peer. Simulation results also verify better performance of our algorithm compared to the algorithm proposed in [SNW08]. As a result, applying our fast distributed algorithm, a performance improvement of the distributed replication algorithm (P2R2) presented in [SNW08] is achieved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Replication in Unstructured Peer-to-Peer Networks with Availability Constraints

Random walks have been proven a scalable search strategy for unstructured peerto-peer (P2P) networks. In a random walk, a query message is forwarded at each step to a randomly chosen neighbor for a limited number of hops. If random walks are employed as a search strategy, data replication is an important method to increase the probability of successful search. In this context, we investigate th...

متن کامل

ProFID: Practical Frequent Item Set Discovery in Peer-to-Peer Networks

This study addresses the problem of discovering frequent items in unstructured P2P networks. This problem is relevant for several distributed services such as cache management, data replication, sensor networks and security. We make three contributions to the current state of the art. First, we propose a fully distributed Protocol for Frequent Item Set Discovery (ProFID) where the result is pro...

متن کامل

Survey of Search and Replication Schemes in Unstructured P2P Networks

P2P computing lifts taxing issues in various areas of computer science. The largely used decentralized unstructured P2P systems are ad hoc in nature and present a number of research challenges. In this paper, we provide a comprehensive theoretical survey of various state-of-the-art search and replication schemes in unstructured P2P networks for file-sharing applications. The classifications of ...

متن کامل

Gossip-Based Reputation Management for Unstructured Peer-to-Peer Networks*

To build an efficient reputation system for peer-to-peer (P2P) networks, we need fast mechanisms to aggregate peer evaluations and to disseminate updated scores to a large number of peer nodes. Unfortunately, unstructured P2P networks are short of secure hashing and fast lookup mechanisms as in structured P2P systems like the DHT-based Chord. In light of this shortcoming, we propose a gossiping...

متن کامل

A Survey of Dynamic Replication Strategies for Improving Response Time in Data Grid Environment

Large-scale data management is a critical problem in a distributed system such as cloud,P2P system, World Wide Web (WWW), and Data Grid. One of the effective solutions is data replicationtechnique, which efficiently reduces the cost of communication and improves the data reliability andresponse time. Various replication methods can be proposed depending on when, where, and howreplicas are gener...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008